Why your model parameter confidences might be too optimistic – unbiased estimation of the inverse covariance matrix
نویسنده
چکیده
Aims. The maximum-likelihood method is the standard approach to obtain model fits to observational data and the corresponding confidence regions. We investigate possible sources of bias in the log-likelihood function and its subsequent analysis, focusing on estimators of the inverse covariance matrix. Furthermore, we study under which circumstances the estimated covariance matrix is invertible. Methods. We perform Monte-Carlo simulations to investigate the behaviour of estimators for the inverse covariance matrix, depending on the number of independent data sets and the number of variables of the data vectors. Results. We find that the inverse of the maximum-likelihood estimator of the covariance is biased, the amount of bias depending on the ratio of the number of bins (data vector variables), p, to the number of data sets, n. This bias inevitably leads to an – in extreme cases catastrophic – underestimation of the size of confidence regions. We report on a method to remove this bias for the idealised case of Gaussian noise and statistically independent data vectors. Moreover, we demonstrate that marginalisation over parameters introduces a bias into the marginalised log-likelihood function. Measures of the sizes of confidence regions suffer from the same problem. Furthermore, we give an analytic proof for the fact that the estimated covariance matrix is singular if p > n.
منابع مشابه
Maximum likelihood spatiotemporal EEG/MEG source analysis
EEG/MEG noise has an unequal variance and is correlated, both in space and in time. Noise variance may differ greatly between samples or sensors, and correlations between samples or sensors can be very high [1-4]. If these noise characteristics are neglected, then an EEG/MEG source analysis will yield unreliable results [e.g. 5, 6]. First, source parameter estimates will be inefficient. That is...
متن کاملAn estimator of the inverse covariance matrix and its application to ML parameter estimation in dynamical systems
An exact formula of the inverse covariance matrix of an autoregressive stochastic process is obtained using the Gohberg}Semencul explicit inverse of the Toeplitz matrix. This formula is used to build an estimator of the inverse covariance matrix of a stochastic process based on a single realization. In this paper, we show that this estimator can be conveniently applied to maximum likelihood par...
متن کاملA Newton Root-Finding Algorithm For Estimating the Regularization Parameter For Solving Ill-Conditioned Least Squares Problems
We discuss the solution of numerically ill-posed overdetermined systems of equations using Tikhonov a-priori-based regularization. When the noise distribution on the measured data is available to appropriately weight the fidelity term, and the regularization is assumed to be weighted by inverse covariance information on the model parameters, the underlying cost functional becomes a random varia...
متن کاملComparing Different Marker Densities and Various Reference Populations Using Pedigree-Marker Best Linear Unbiased Prediction (BLUP) Model
In order to have successful application of genomic selection, reference population and marker density should be chosen properly. This study purpose was to investigate the accuracy of genomic estimated breeding values in terms of low (5K), intermediate (50K) and high (777K) densities in the simulated populations, when different scenarios were applied about the reference populations selecting. Af...
متن کاملRegularization parameter estimation for underdetermined problems by the χ principle with application to 2D focusing gravity inversion
Abstract. The χ-principle generalizes the Morozov discrepancy principle to the augmented residual of the Tikhonov regularized least squares problem. For weighting of the data fidelity by a known Gaussian noise distribution on the measured data and, when the stabilizing, or regularization, term is considered to be weighted by unknown inverse covariance information on the model parameters, the mi...
متن کامل